智能论文笔记

End-to-end deep learning for directly estimating grape yield from ground-based imagery

Alexander G. Olenskyj , Brent S. Sams , Zhenghao Fei , Vishal Singh , Pranav V. Raja , Gail M. Bornhorst , J. Mason Earles

分类：计算机视觉

2022-08-04

产量估计是葡萄园管理中的强大工具，因为它允许种植者微调实践以优化产量和质量。但是，目前使用手动抽样进行估计，这是耗时和不精确的。这项研究表明，近端成像的应用与深度学习相结合，以进行葡萄园中的产量估计。使用车辆安装的传感套件进行连续数据收集，并使用商业收益率监控器在收获时结合了地面真实收益数据的收集，可以生成一个23,581个收益点和107,933张图像的大数据集。此外，这项研究是在机械管理的商业葡萄园中进行的，代表了一个充满挑战的图像分析环境，但在加利福尼亚中央山谷中的一组常见条件。测试了三个模型架构：对象检测，CNN回归和变压器模型。对象检测模型在手工标记的图像上进行了训练以定位葡萄束，并将束数量或像素区域求和以与葡萄产量相关。相反，回归模型端到端训练，以预测图像数据中的葡萄产量，而无需手动标记。结果表明，在代表性的保留数据集上，具有相当的绝对百分比误差为18％和18.5％的变压器和具有像素区域处理的对象检测模型。使用显着映射来证明CNN模型的注意力位于葡萄束的预测位置附近以及葡萄树冠的顶部。总体而言，该研究表明，近端成像和深度学习对于大规模预测葡萄群的适用性。此外，端到端建模方法能够与对象检测方法相当地执行，同时消除了手工标记的需求。

translated by 谷歌翻译

A workflow for segmenting soil and plant X-ray CT images with deep learning in Googles Colaboratory

Devin A. Rippner , Pranav Raja , J. Mason Earles , Alexander Buchko , Mina Momayyezi , Fiona Duong , Dilworth Parkinson , Elizabeth Forrestel , Ken Shackel , Jeffrey Neyhart

分类：计算机视觉

2022-03-18

X射线微型计算机断层扫描（X射线Microct）已使以微米尺度上的植物和土壤中发生的特性和过程表征。尽管这种高级技术广泛使用，但硬件和软件的主要限制都限制了图像处理和数据分析的速度和准确性。机器学习的最新进展，特别是将卷积神经网络应用于图像分析的应用，已实现了图像数据的快速而准确的分割。然而，在将卷积神经网络应用于环境和农业相关图像的分析中仍然存在挑战。具体而言，计算机科学家和工程师，构建这些AI/ML工具的工程师与农业研究中潜在的最终用户之间存在脱节，他们可能不确定如何在其工作中应用这些工具。此外，与传统的计算系统相比，培训和应用深度学习模型所需的计算资源是独特的，对计算机游戏系统或图形设计工作更为常见。为了应对这些挑战，我们开发了一个模块化工作流程，用于使用Googles Colaboragoration Web应用程序中的低成本资源，将卷积神经网络应用于X射线Microct图像。在这里，我们介绍了工作流的结果，说明了如何使用核桃叶，杏仁花芽和土壤骨料的示例扫描来优化参数以获得最佳结果。我们预计该框架将加速植物和土壤科学中新兴的深度学习技术的采用和使用。

translated by 谷歌翻译

Simultaneously Predicting Multiple Plant Traits from Multiple Sensors via Deformable CNN Regression

Pranav Raja , Alex Olenskyj , Hamid Kamangir , Mason Earles

分类：计算机视觉

2021-12-06

特征测量对于植物育种和农业生产管道至关重要。通常，使用费力的手动测量测量一套植物特征，然后用于培训和/或验证更高的吞吐量特征估计技术。这里，我们介绍了一种相对简单的卷积神经网络（CNN）模型，该模型接受多个传感器输入并预测多个连续特征输出 - 即多输入，多输出CNN（MIMO-CNN）。此外，我们将可变形的卷积层引入该网络架构（MIMO-DCNN），以使模型能够自适应地调整其接收领域，模拟数据中的复杂变量几何变换，以及微调连续的特征输出。我们检查MIMO-CNN和MIMO-DCNN模型如何在多输入（即RGB和深度图像）上执行，来自2021年自主温室挑战的多特征输出莴苣数据集。进行了消融研究以检查使用单一与多个输入的效果，以及单个与多个输出。 MIMO-DCNN模型导致归一化平均平方误差（NMSE）为0.068 - 顶部2021排行榜得分为0.081的实质性改进。提供了开源代码。

translated by 谷歌翻译

BNSynth: Bounded Boolean Functional Synthesis

Ravi Raja , Stanly Samuel , Chiranjib Bhattacharyya , Deepak D'Souza , Aditya Kanade

分类：人工智能 | 机器学习

2022-12-15

The automated synthesis of correct-by-construction Boolean functions from logical specifications is known as the Boolean Functional Synthesis (BFS) problem. BFS has many application areas that range from software engineering to circuit design. In this paper, we introduce a tool BNSynth, that is the first to solve the BFS problem under a given bound on the solution space. Bounding the solution space induces the synthesis of smaller functions that benefit resource constrained areas such as circuit design. BNSynth uses a counter-example guided, neural approach to solve the bounded BFS problem. Initial results show promise in synthesizing smaller solutions; we observe at least \textbf{3.2X} (and up to \textbf{24X}) improvement in the reduction of solution size on average, as compared to state of the art tools on our benchmarks. BNSynth is available on GitHub under an open source license.

translated by 谷歌翻译

ADIR: Adaptive Diffusion for Image Reconstruction

Shady Abu-Hussein , Tom Tirer , Raja Giryes

分类：计算机视觉

2022-12-06

In recent years, denoising diffusion models have demonstrated outstanding image generation performance. The information on natural images captured by these models is useful for many image reconstruction applications, where the task is to restore a clean image from its degraded observations. In this work, we propose a conditional sampling scheme that exploits the prior learned by diffusion models while retaining agreement with the observations. We then combine it with a novel approach for adapting pretrained diffusion denoising networks to their input. We examine two adaption strategies: the first uses only the degraded image, while the second, which we advocate, is performed using images that are ``nearest neighbors'' of the degraded image, retrieved from a diverse dataset using an off-the-shelf visual-language model. To evaluate our method, we test it on two state-of-the-art publicly available diffusion models, Stable Diffusion and Guided Diffusion. We show that our proposed `adaptive diffusion for image reconstruction' (ADIR) approach achieves a significant improvement in the super-resolution, deblurring, and text-based editing tasks.

translated by 谷歌翻译

From Competition to Collaboration: Making Toy Datasets on Kaggle Clinically Useful for Chest X-Ray Diagnosis Using Federated Learning

Pranav Kulkarni , Adway Kanhere , Paul H. Yi , Vishwa S. Parekh

分类：计算机视觉 | 机器学习

2022-11-11

Chest X-ray (CXR) datasets hosted on Kaggle, though useful from a data science competition standpoint, have limited utility in clinical use because of their narrow focus on diagnosing one specific disease. In real-world clinical use, multiple diseases need to be considered since they can co-exist in the same patient. In this work, we demonstrate how federated learning (FL) can be used to make these toy CXR datasets from Kaggle clinically useful. Specifically, we train a single FL classification model (`global`) using two separate CXR datasets -- one annotated for presence of pneumonia and the other for presence of pneumothorax (two common and life-threatening conditions) -- capable of diagnosing both. We compare the performance of the global FL model with models trained separately on both datasets (`baseline`) for two different model architectures. On a standard, naive 3-layer CNN architecture, the global FL model achieved AUROC of 0.84 and 0.81 for pneumonia and pneumothorax, respectively, compared to 0.85 and 0.82, respectively, for both baseline models (p>0.05). Similarly, on a pretrained DenseNet121 architecture, the global FL model achieved AUROC of 0.88 and 0.91 for pneumonia and pneumothorax, respectively, compared to 0.89 and 0.91, respectively, for both baseline models (p>0.05). Our results suggest that FL can be used to create global `meta` models to make toy datasets from Kaggle clinically useful, a step forward towards bridging the gap from bench to bedside.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-Loop

Neelesh K Shukla , Msp Raja , Raghu Katikeri , Amit Vaid

分类：自然语言处理

2022-11-09

Business documents come in a variety of structures, formats and information needs which makes information extraction a challenging task. Due to these variations, having a document generic model which can work well across all types of documents and for all the use cases seems far-fetched. For document-specific models, we would need customized document-specific labels. We introduce DoSA (Document Specific Automated Annotations), which helps annotators in generating initial annotations automatically using our novel bootstrap approach by leveraging document generic datasets and models. These initial annotations can further be reviewed by a human for correctness. An initial document-specific model can be trained and its inference can be used as feedback for generating more automated annotations. These automated annotations can be reviewed by human-in-the-loop for the correctness and a new improved model can be trained using the current model as pre-trained model before going for the next iteration. In this paper, our scope is limited to Form like documents due to limited availability of generic annotated datasets, but this idea can be extended to a variety of other documents as more datasets are built. An open-source ready-to-use implementation is made available on GitHub https://github.com/neeleshkshukla/DoSA.

translated by 谷歌翻译

A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing

Pranav Shetty , Arunkumar Chitteth Rajan , Christopher Kuenneth , Sonkakshi Gupta , Lakshmi Prerana Panchumarti , Lauren Holm , Chao Zhang , Rampi Ramprasad

分类：自然语言处理

2022-09-27

不断增加的材料科学文章使得很难从已发表的文献中推断化学结构 - 培训关系。我们使用自然语言处理（NLP）方法从聚合物文献的摘要中自动提取材料属性数据。作为我们管道的组成部分，我们使用240万材料科学摘要培训了一种语言模型的材料，该材料模型在用作文本编码器时，在五分之三命名实体识别数据集中的其他基线模型都优于其他基线模型。使用此管道，我们在60小时内从约130,000个摘要中获得了约300,000个物质记录。分析了提取的数据，分析了各种应用，例如燃料电池，超级电容器和聚合物太阳能电池，以恢复非平凡的见解。通过我们的管道提取的数据可通过https://polymerscholar.org的Web平台提供，该数据可方便地定位摘要中记录的材料属性数据。这项工作证明了自动管道的可行性，该管道从已发布的文献开始，并以一组完整的提取物质属性信息结束。

translated by 谷歌翻译

Just-In-Time Learning for Operational Risk Assessment in Power Grids

Oliver Stover , Pranav Karve , Sankaran Mahadevan , Wenbo Chen , Haoruo Zhao , Mathieu Tanneau , Pascal Van Hentenryck

分类：机器学习

2022-09-26

在具有可再生生成的大量份额的网格中，由于负载和发电的波动性增加，运营商将需要其他工具来评估运营风险。正向不确定性传播问题的计算要求必须解决众多安全受限的经济调度（SCED）优化，是这种实时风险评估的主要障碍。本文提出了一个即时风险评估学习框架（Jitralf）作为替代方案。 Jitralf训练风险代理，每天每小时一个，使用机器学习（ML）来预测估计风险所需的数量，而无需明确解决SCED问题。这大大减轻了正向不确定性传播的计算负担，并允许快速，实时的风险估计。本文还提出了一种新颖的，不对称的损失函数，并表明使用不对称损失训练的模型的性能优于使用对称损耗函数的模型。在法国传输系统上评估了Jitralf，以评估运营储量不足的风险，减轻负载的风险和预期的运营成本。

translated by 谷歌翻译